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(54) Audio signal processor comprising a means for embedding an audible watermark in an audio 
signal, audio player comprising a means for removing the audible watermark and audio 
distribution system and method using the audio signal processor and the audio player 



(57) An audio signal processor, audio player, audio 
distribution system and the method thereof which permit 
sample playback and are applicable to uncompressed 
audio contents, and wherein the sound quality is possi- 
ble to control per frequency band. 

Provided are an audio signal processor provided 



with embedding means for embedding in the audio sig- 
nal a watermark audible to the human sense of hearing 
when the audio signal is played back, and an audio play- 
er provided with removing means for removing, using a 
specific key, the watermark embedded in the audio sig- 
nal, and an audio distribution system and a method 
thereof in which those apparatuses are utilized. 
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Description 

[0001] The present invention relates to an audio sig- 
nal processor, audio player, audio distribution system 
and the method thereof. More particularly, the present 5 
invention concerns audio signal processor, audio player, 
audio distribution system and the method thereof in 
which the copyright for audio contents is protected using 
a watermark technique. 

[0002] With a fast diffusion of the Internet and 10 
progress in audio compression technologies exempli- 
fied by MP3 (MPEG Audio Layer-3) and AAC (Advanced 
Audio Codec) in recent years, the audio distribution sys- 
tem is getting popular. In the audio distribution system, 
audio contents, namely, audio signals are compressed 15 
and electronically distributed to the consumers via a net- 
work (Internet), who download audio contents and play 
back them in a corresponding audio player. 
[0003] The distribution (sale) of audio contents by the 
audio distribution system needs no salesroom space 20 
unlike sale of audio contents as at CD shops. Therefore, 
audio contents by little known artists with no much sales 
expected also can be offered to consumers without dif- 
ficulty. Furthermore, in an environment where a network 
is accessible, the consumer can obtain those audio con- 25 
tents at any time. 

[0004] The prior art music distribution apparatus and 
audio player will be explained with reference to FIG. 9. 
[0005] In a first prior art, distribution apparatus 901 is 
provided with an audio signal processor 902 and a dis- 30 
tributor 903. The audio signal processor 902 comprises 
a compressor 904 for compressing audio signals and an 
encryptor 905 for encrypting the compressed audio sig- 
nals using first key 906. The audio signals processed by 
the compressor 904 and the encryptor 905, that is, audio 35 
signal distribution data are stored in the distributor 903 
and are sent to a consumer on request or the like from 
an audio player 908 for playback of the audio signal data 
via a network 907. 

[0006] For the consumer to playback and listen to the *o 
audio signal distribution data thus distributed, a decoder 
909 in the audio player 908 decodes (decrypts) the au- 
dio signal distribution data using a second key 910. Fur- 
thermore, a decompressor 911 in the audio player 908 
decompresses the decoded (decrypted) audio signal 45 
distribution data, and audio playback means 912 play- 
back the data. 

[0007] The second key 910 used by decoder 909 is 
information that can be obtained via the network 907 or 
the like by the consumer paying a fee through electronic so 
settlement of accounts. In other words, the consumer 
who has not purchased second key 91 0 can not decode 
audio signal distribution data and can not play back au- 
dio contents. Therefore, a contents provider which dis- 
tributes the audio signal distribution data can prevent 55 
audio contents from being illegally copied, whereby the 
copyrights can be prevented. 

[0008] Next, a second prior art is disclosed as AN AU- 



DIO ENGINEERING SOCIETY PREPRINT 51 00 ("Se- 
cure Delivery of Compressed Audio by Compatible Bit- 
stream Scrambling," Eric Allamanche et al., Fraunhofer 
Institute for Integrated Circuits). This is a technique for 
encrypting the audio contents by encryptor 905 in the 
first prior art, for example. That is, only part of the al- 
ready compressed audio signals is encrypted. This en- 
cryption method involves encrypting the least significant 
bit of audio data, each quantinzed per frequency band 
or rearranging some values of spectral coefficient ac- 
cording to a specific rule. Specifically, for example, it is 
a technique in which the distribution apparatus 901 en- 
crypts the lower order bits of the spectral coefficient in 
a compressed audio data by using exclusive OR with a 
key of the same bit number and the audio player 908 
decodes the lower order bits. That apparently degrades 
audiologically the sent audio contents in sound quality. 
Meanwhile, if the consumer buys a second key and in- 
puts it in a audio player 908, partly encrypted audio data 
will be decoded, and audio contents of high sound qual- 
ity can be played back. 

[0009] This way, the contents provider can distribute 
low sound quality audio contents to consumers as sam- 
ple to promote the sales. Before deciding whetherto buy 
key, the consumer can perform sample-playback to au- 
dio contents though the contents are low in sound qual- 
ity. 

[0010] However, the prior art music distribution and 
audio player present the following problems. 
[0011] First, in the first prior art technique, the second 
key 910 has to be bought to reproduce the encrypted 
audio contents. In other words, all the audio signals 
forming the distributed audio contents are encrypted, 
and the original audio signals are not retained at all and 
impossible to play back or, if ever played back, it is a 
collection of noises quite different from the original audio 
contents. Therefore, the consumer can not perform 
sample-playback to the audio contents before paying for 
the second key 91 0. 

[0012] Next, the second prior art is a technique for 
compressed audio contents and can not control the 
sample playback of contents which are not compressed 
contents. Therefore, this technique can not be applied 
in case audio contents are sent with the high sound qual- 
ity of music CD (compact disc) retained. 
[0013] Furthermore, in the second prior art technique 
in which for the least significant bit of the spectral coef- 
ficient of compressed audio data, exclusive OR is used 
with the key of the same number of bits, the same key 
has to be used for encryption and decoding. Therefore, 
the problem is that if the key is disclosed by a malicious 0 
third party, consumers can play back high quality audio 
contents using the disclosed key. Also, a method in 
which the lower order bits are exchanged with each oth- 
er has a problem. That is, it is difficult to quantitatively 
predict to what extent the sound quality will degrade. 
Some kinds of audio data do not degrade acoustically 
so much. Therefore, audio contents processed by this 
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method have to be checked if the processing is effective 
enough. 

[0014] Accordingly, it is an object of the present inven- 
tion to provide to an , audio signal processor, audio play- 
er, audio distribution system and the method thereof 
wherein audio contents that permit sample playback can 
be prepared, and which are applicable to uncompressed 
audio contents and furthermore make it possible to con- 
trol sound quality per frequency band. 
[0015] To achieve the foregoing object, the present in- 
vention is provided with the following means. 
[0016] That is, the present invention presupposes 
processing the audio signal for changing a format dis- 
tributable through a network. Here, embedding means 
for embedding in the audio signal a watermark of which 
a signal level audible to the human sense of hearing 
when audio signal is played back. It is to be understood 
that audio signal include music, sound and voice sig- 
nals. 

[0017] Thus, the audio signal has a watermark em- 
bedded in it that can be perceived by the human sense 
of hearing. With the watermark as noise or the like, the 
audio signal containing the noise can be performed 
sample-playback. Needless to say, if the watermark that 
is embedded is removed, the audio signal can be played 
back as high quality sound. It is also possible to provide 
sample audio content and high quality audio content in 
one and the same signal. 

[0018] Furthermore, other arrangements are possi- 
ble. In one of the other arrangements, the compressor 
for compressing the watermark embedded audio signal 
in a specific method is provided after the embedding 
means. In another arrangement, the compressor is pro- 
vided before the embedding means. 
[0019] Under those arrangements, even such cases 
where the supplier of the audio signal and the contents 
provider of the audio signal are different can be dealt 
with flexibly. In addition, the volume of processing for 
embedding the watermark can be reduced. 
[0020] Furthermore, the system can be so arranged 
that the embedding means inputs the audio signal alone 
and generates a watermark on the basis of the audio 
signal. In this case, since no specific signals need to be 
inputted as noise, the configuration of equipment can 
be simplified. 

[0021] It is noted that for the watermark embedded 
audio signal to be played back as high quality sound, 
the audio player is to be provided with removing means 
for removing the watermark of a signal level perceivable 
by the human sense of hearing that is embedded in the 
audio signal. The watermark is removed using a specific 
key. 

[0022] In an another configuration, the system is pro- 
vided with a band separator for separating the audio sig- 
nal into a plurality of frequency band signals, each sig- 
nals have a specific frequency band respectively, em- 
bedding means for embedding a key as watermark in a 
specific frequency band signals having the specific fre- 



quency band of the plurality of frequency band signals, 
and high quality sound part encryptor for encrypting a 
frequency band signal other than said plurality of fre- 
quency band signals in which said watermark is embed- 
5 ded. 

[0023] In this configuration, the system is so arranged 
that only the low quality sound part can be played back 
since a specific frequency band alone can be encrypted 
so that the audio signals including the high sound quality 
10 part can be played back by obtaining a specific key. 
[0024] In another configuration, there are provided a 
scalable compressor for separating the audio signal into 
the basic part and the enhanced part using scalable 
compression and enhanced part encryptor for encrypt- 
1 5 ing the enhanced part using a specific key. 

[0025] In still another configuration, the system is pro- 
vided with a noise parameter generator for generating 
a noise parameter to produce noise, a noise generator 
for generating noise signals on the basis of the gener- 
ated noise parameter, amplifier for amplifying the noise 
signal to a signal level audible to the human sense of 
hearing, a first adder for adding the noise signal to the 
audio signal, a watermark signal generator for generat- 
ing a watermark signal with the noise parameter, and a 
second adder for adding the watermark signal to the au- 
dio signal to which the noise signal has been added. 
[0026] In the above configuration, an announcement 
can be used as noise signals so that playback does not 
make the listener fee I unpleasant. Furthermore, the an- 
nouncement can be utilized for various purposes includ- 
ing notifying that the noise signal can be removed if a 
fee is paid for the key. In addition, since the noise pa- 
rameter is embedded as an watermark, the manage- 
ment of key will be easy if a second key to extract the 
watermark is additionally prepared. That is, even in case 
a plurality of audio signals are prepared that contain 
noises based on a plurality of noise parameters, only 
one kind of the second key will serve the purpose. 
[0027] Still more configurations are possible. In one 
of them, the system is provided with embedding means 
for embedding music ID information in the audio signal 
as a watermark to specify the audio signal and an en- 
cryptor for encrypting the audio signal in which the wa- 
termark is embedded. In another configuration, the mu- 
sic ID Information contains the number of sample play- 
back that are permitted. 

[0028] Because the music ID information is embed- 
ded as a watermark in the above configuration, there- 
fore, there is no possibility that music ID information will 
be lost in digital/analog (D/A) conversion or analog/dig- 
ital (A/D) conversion, and the rightful user who has a 
key exclusive to the audio content can play back the au- 
dio signal into a high quality sound. Furthermore, since 
the music ID information is embedded as a watermark, 
the music ID information is difficult for a binary editor or 
the like to modify, whereby the copyright protection is 
further reinforced. With the number of sample playback 
embedded as a watermark, in addition, the consumer is 
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allowed to listen to the audio content a specific number 
of times only before deciding whether to buy the key ex- 
clusive to the music. 

[0029] FIG. 1 is a hardware block diagram showing 
the outline of a audio 'distribution system according to 
the present invention. 

[0030] FIG. 2 is a conceptual diagram of embedding 
and removing a watermark according to the present in- 
vention. 

[0031] FIG. 3 is a hardware block diagram showing 
the outline of an audio signal processor and restoration 
means according to a second embodiment of the 
present invention. 

[0032] FIG. 4 is a schematic diagram showing the out- 
line of the compressor and frequency band separator 
according to the second embodiment of the present in- 
vention. 

[0033] FIG. 5 is a schematic diagram showing the out- 
line of an audio signal processor and restoration means 
according to a third embodiment of the present inven- 
tion. 

[0034] FIG. 6 is a hardware block diagram showing 
an audio signal processor and restoration means of a 
fourth embodiment of the present invention. 
[0035] FIG. 7 is a schematic diagram showing the out- 
line of an audio signal processor and restoration means 
according to a fifth embodiment of the present invention. 
[0036] FIG. 8 is a schematic diagram showing the out- 
line of an audio signal processor and restoration means 
according to a sixth embodiment of the present inven- 
tion. 

[0037] FIG. 9 is a prior art distribution system. 
[0038] FIG. 1 0 is a first diagram explaining the mask- 
ing level. 

[0039] FIG. 11 is a second diagram explaining the 
masking level. 

[0040] The embodiments of the present invention will 
now be described with reference to the accompanying 
drawings. It is to be understood that the following em- 
bodiments are examples embodying the present inven- 
tion and do not limit the technical scope of the invention. 

Embodiment 1 

[0041] First, the outline of a distribution system ac- 
cording to the present invention will be explained with 
reference to FIG. 1 . A audio distribution system 1 22 
shown in FIG. 1 is formed of a distribution apparatus 
101, a audio player 111 and a network 110. The distri- 
bution apparatus 101 is to convert audio signals into au- 
dio signal distribution data and is made up of an audio 
signal processor 102, a distributor 103 and a storage 
107. The audio signal processor 102 converts inputted 
audio signals into audio signal distribution data and 
sends the data to the distributor 1 03. The distributor 1 03 
stores the audio signal distribution data in the storage 
107, and, furthermore, sends the audio signal distribu- 
tion data via the network 110, for example, in accord- 



ance with a request to send by audio player 111. The 
distributor 103 also sends key information in accord- 
ance with a request for key by the audio player 111, 
which will be described later. 

5 [0042] The audio player 1 1 1 is formed of a transceiver 
112, a restoration means 121 , an audio playback means 
119 and a storage 114. In accordance with a request by 
the consumer, the audio player 111 makes a request for 
distribution of specific distribution data to the distribution 

10 apparatus 1 01 via the network, and restores the distrib- 
uted distribution data in a method corresponding to the 
processing method for the audio signal processor 102. 
The audio signal that is restored to a playable form will 
be outputted acoustically by the audio playback means 

*5 119. on request by the consumer, the transceiver 112 
also receives a key corresponding to the audio signal 
distribution data. The concrete examples of the audio 
playerm include a portable player, a personal compu- 
ter and audio equipment. The term "audio signal" as 

20 used herein means data making up audio contents of 
such as a popular song including sound and voice sig- 
nals. Needless to say, sound and/or voice signal alone 
is audio contents, too. The audio signal distribution data 
means audio signals processed to be sent and received 

25 via wire or wireless network 1 1 0 represented by the In- 
ternet. 

[0043] Next, the procedures in the distribution appa- 
ratus 101 and the audio player 111 will be explained in 
detail with reference to FIG. 1 . 

30 [0044] First, an audio signal is inputted into the distri- 
bution apparatus 101. The way of inputting the audio 
signal is not restrictive in particular. For example, with a 
separate audio player connected to the distribution ap- 
paratus 101 , the signal that is inputted from the repro- 

35 duction apparatus may be taken as audio signal. The 
examples of the audio player include a CD player and 
record player. The audio signals are sent in usual sound 
data formats such as linear pulse code modulation 
(PCM) format and analog format. In case the output of 

*o the audio player is an analog signal, the signal is con- 
verted into digital signal in advance as necessary. 
[0045] The inputted audio signal will have a water- 
mark embedded by embedding means 104. Generally, 
this watermark is information such as ID which is nec- 

45 essary for control but has nothing to do with the audio 
signals. To put it another way, this watermark is a digital 
signal that is embedded in the audio signal according to 
a specific rule and which can be taken out using a meth- 
od corresponding to the specific rule. Here in Embodi- 

50 ment 1 , the watermark is adjusted to a level of sound 
perceivable (audible) by the human sense of hearing 
and embedded using a first key, which will be described 
later, whereby the sound and/or voice to be outputted 
when the audio signal is played back can be degraded. 

55 As used herein, the term "a level of sound perceivable 
by the human sense of hearing signal" means a signals 
level higher than the masking level of audio signals that 
change every moment, for example. The term "masking" 
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is a phenomenon that when man hears a sound called 
"Masker," he comes not to perceive a sound close to the 
frequency of a sound called "Maskee." And the thresh- 
old limit value between the sound pressure level where 
"Maskee" can be perceived and the sound pressure lev- 5 
el where that can not is called the masking level. FIG. 
10 is an example of a sine wave, and the frequency fO 
is a Masker and the dotted line indicates the masking 
level. In this case, the sine wave with frequency f1 is 
below the masking level and is not perceived by the hu- 10 
man sense of hearing, while the sine wave with f requen- 
cy f2 is a sound of a sound pressure above the masking 
level and can be perceived by the human sense of hear- 
ing. FIG. 11 is an example of audio signal 1101 at a point 
of time. Here, if the audio signal 11 01 is given an addition 15 
of a signal 1103 of a sound pressure exceeding the 
masking level 1102 in at least part of the band, the com- 
ponent of the signal to which the masking level 1102 is 
added is perceived by the sense of hearing. While the 
masking level value depends on audio signals, it is re- 20 
ported in a study by Egan et al. (On the Masking Pattern 
of a Simple Auditory Stimulus, J. Acous, Soc. Am. 22, 
622-630, 1950) that if, for example, a band noise at 80 
dB centering around 400 Hz is a Masker, the masking 
level at 400 Hz is about 60 dB. In case of audio signal, 25 
the masking level changes every moment depending on 
the characteristics of the audio signals. If, therefore, a 
watermark signal of a sound pressure level higher than 
the masking level that can change every moment is add- 
ed to the audio signal, it will be possible to embed a wa- 30 
termark of a signal level that can be perceived by the 
human sense of hearing. The embedding of watermark 
by embedding means 104 will be explained in detail lat- 
er. 

[0046] The audio signal with a watermark embedded 35 
by the embedding means 104 is then compressed to a 
format of audio signal distribution data by compressor 
105. The compression format of the audio signal distri- 
bution data is MP3, AAC or the like. But the audio signal 
does not always have to be compressed though the data 40 
size will be large unless the audio signal is compressed, 
and the linear PCM as it is will do, too. That is, the audio 
signal processor 102 embeds a watermark in an input- 
ted audio signal and, as necessary, compresses it. 
[0047] Then, the watermark Is embedded, and the 45 
compressed audio signal is transmitted to distributor 
103. Receiving the compressed audio signal with the 
watermark embedded therein, the distributor 1 03 adds 
to the audio signal the address information of the distri- 
bution apparatus 101 and identification information to so 
specify this audio signal and stores the audio signal in 
storage 1 07 as audio signal distribution data. It is noted 
that the distributor 103 is to store in the storage 107 
many kinds of audio signal distribution data according 
to the memory capacity of the storage 107. The audio 55 
signal distribution data thus stored are sent to the audio 
player 111 via the network 110 on request by the audio 
player 1 1 1 , for example. 



[0048] When received by the transceiver 1 1 2 of the 
audio player 111, the audio signal distribution data thus 
sent is once stored in the storage 114. Then, decom- 
pressor 113 in the restoration means 121 reads out the 
audio signal distribution data stored in the storage 114 
and decompresses the audio signal distribution data in 
a method matched with the compression carried out by 
the compressor 1 05. But the audio signal distribution da- 
ta which have not been compressed by the compressor 

105 will not be decompressed, either. For sample play- 
back only or the like, the audio signal distribution data 
may be directly transmitted to the decompressor 113 
from the transceiver 1 1 2 without being stored in the stor- 
age 114. 

[0049] The decompressed audio signal distribution 
data is transmitted to the removing means 115 in the 
restoration means 121 , and the watermark embedded 
by the embedding means 104 Is removed. The removal 
of the watermark will be explained in detail later. 
[0050] With the watermark removed, the audio signal 
distribution data is transmitted to audio playback means 
1 1 9 and is outputted acoustically by the audio playback 
means 119. 

[0051] There will be described the procedures at em- 
bedding means and removing means with reference to 
FIG. 1 and FIG. 2. 

[0052] In the embedding means 1 04, an inputted au- 
dio signal is transmitted to adder 1 08 and watermark sig- 
nal generator 1 06. An example of the audio signal trans- 
mitted to the adder 108 and the watermark signal gen- 
erator 106 is shown as an audio signal 201. Receiving 
the audio signal 201 , the watermark signal generator 

106 generates a watermark signal 202 on the basis of 
the audio signal 201 , and the first key 1 09 stored in ad- 
vance in storage 107. 

[0053] The watermark signal 202 generated here is a 
level of signal (noise) audible to the human sense of 
hearing when reproduced. In other words, a watermark 
has to have merely a level of signal audible to the human 
sense of hearing. There is no need to input in the wa- 
termark signal generator 106 a significant digital signal 
such as, for example, control information like the prior 
art watermark. 

[0054] Then, the watermark signal 202 is handed over 
to the adder 1 08 from the watermark signal generator 
1 06. The adder 1 08 adds the inputted audio signal 201 
and the watermark signal 202 to generate a watermark 
embedded audio signal 203. Here, the watermark em- 
bedded audio signal 203 is almost identical with the au- 
dio signal 201 in waveform but slightly different because 
the watermark signal 202 is added. Since the watermark 
embedded audio signal 203 is given an addition of the 
watermark signal, the watermark signal 202 (noise) as 
well as the audio signal is played back . In other words, 
the watermark embedded audio signal is the information 
of audio signal 201 added with noises. 
[0055] The watermark embedded audio signal 203 is 
compressed by the compressor 105 as necessary and 
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becomes audio signal distribution data with the address 
information and identification information added thereto, 
and sent to the audio player 1 1 1 via the network 1 1 0 and 
transmitted to removing means 115 by way of the de- 
compressor 113 as set forth above. 
[0056] Receiving the watermark embedded audio sig- 
nal 203, the removing means 115hands over the water- 
mark embedded audio signal 203 to adder 116 and ex- 
tracting means 117. Receiving the watermark embed- 
ded audio signal 203, the extracting means 117 search- 
es the storage 1 1 4 and obtains a second key 1 20 for the 
watermark embedded audio signal 203. Then, the ex- 
tracting means 117 generates a similar watermark sig- 
nal based on the watermark embedded audio signal 203 
using the watermark embedded audio signal 203 and 
the second key 120 stored in storage 114. The way in 
which the audio player 111 acquires the second key 1 20 
will be described later. 

[0057] Here, the similar watermark has almost the 
same waveform as the watermark signal 202. But while 
the watermark signal 202 is a signal generated on the 
basis of audio signal 201 , the similar watermark is pro- 
duced from watermark embedded audio signal 203 
which is slightly different from the audio signal 201. 
Therefore, the similar watermark is slightly different from 
the audio signal 201 in waveform. 
[0058] Next, the amplitude of the similar watermark 
signal generated on the basis of the watermark embed- 
ded audio signal 203 is reversed into watermark remov- 
ing signal 204 by reversing means 118. Here, the wa- 
termark removing signal 204 generated by reversing the 
amplitude has an amplitude with a positive or negative 
sign opposite to that of the watermark signal 202. There- 
fore, if the watermark removing signal and the water- 
mark signal 202 are added together, the two signals off- 
set each other to a level of sound not audible to the hu- 
man sense of hearing. 

[0059] The watermark removing signal 204 generated 
by the reversing means 1 1 8 is then transferred to adder 
116. The adder 116 acquires a reproduced signal 205 
by adding the watermark embedded audio signal 203 
and the watermark removing signal 204. That is, though 
the reproduced signal 205 is a signal with the watermark 
signal 202 (noise) removed from the watermark embed- 
ded audio signal 203, the quality of the reproduced 
sound will be as high as the audio signal inputted in the 
audio signal processor 1 02, because the noise is re- 
moved so much that the noise is not audible to humans. 
[0060] The reproduced signal 205 is transferred to au- 
dio playback means 119. By playing back the repro- 
duced signal 205, the audio playback means 119 per- 
mits the consumer to listen to a reproduced sound of 
high quality. 

[0061] In this connection, in case no second key 1 20 
for the watermark embedded audio signal 203 is found 
when the extracting means 117 searches the storage 
11 4, the extracting means 117can not generate a similar 
watermark signal from the watermark removing signal 



204 and sends no signal to the reversing means 118. 
That is, the watermark embedded audio signal 203 is 
transferred to audio playback means 119 without the 
watermark signal removed. The audio playback means 

5 119 plays back watermark embedded audio signal 203. 
In other words, the consumer listens to a reproduced 
sound of a degraded sound quality containing noise. But 
if a second key which will be explained later is pur- 
chased, the consumer can listen to a high-quality sound 

10 without difficulty. 

[0062] There will be explained in detail the procedure 
for obtaining the key with reference to FIG. 1 . 
[0063] If the second key 120 for the watermark em- 
bedded audio signal 203 is not found when the extract- 

15 ing means 117 searches the storage 114, transceiver 
112 lets the consumer know that a key is needed and 
urges the consumer to buy the key as by lighting an 
alarm lamp, indicating that on the display or announcing 
that. 

20 [0064] When the transceiver 112 notifies the consum- 
er that a key is needed, the consumer can indicate to 
the audio player 1 1 1 an intention to purchase the key by 
taking a specific procedure at the transceiver 1 1 2 (as by 
pressing a button). When the specific procedure is com- 

25 pleted, the transceiver 112 reads out address informa- 
tion and identification information added to the water- 
mark embedded audio signal 203 by distributor 1 03 and 
establishes a connection with the network 110. 
[0065] After having been connected with the network 

30 no, the transceiver 112 communicates with the distri- 
bution apparatus 1 01 on the basis of address informa- 
tion thereof stored in the watermark embedded audio 
signal 203. 

[0066] While communicating with the distribution ap- 

35 paratus 1 01 , the transceiver 1 1 2 sends the identification 
information read out by the transceiver 112 to the distri- 
bution apparatus 101 in the distributor 103. 
[0067] Receiving the identification information, the 
distributor 1 03 does a procedure for charging on the ba- 

40 sis of the identification information, and a procedure for 
payment is made between the audio player 111 and the 
distribution apparatus 101. Here, the payment is made 
by credit card, in electronic money, from the bank ac- 
count or the like, the details of which will not be ex- 

45 plained here. 

[0068] If the payment procedure is completed and has 
no problem, the distributor 1 03 selects a second key for 
the watermark embedded audio signal 203 from the 
storage 1 07 on the basis of the identification information 

50 and sends the second key to the audio player 111. Re- 
ceiving the second key, the transceiver 1 1 2 memorizes 
the key and the applicable identification information in 
the storage 114. Through that procedure, the audio play- 
er 1 1 1 can get the necessary key. 

55 [0069] As set forth above, a watermark (noise) of a 
level audible to the human sense of hearing is added to 
the audio signal so that the audio contents (audio signal) 
can be performed sample-playback. Therefore, this 
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technique can be applied to not only compressed audio 
contents but also uncompressed audio contents. It 
means that unlike the second prior art, the present tech- 
nique retains high quality sound without compression 
and permits the sending of audible audio contents, too. 
Furthermore, if the audio signal is compressed or de- 
compressed time-wise by using watermark, that is, even 
in case of slightly fast forwarding (play back), for exam- 
ple, the degree of the compression or decompression 
can be detected and the watermark can be removed. 
Regarding this case, audio signals are easier to handle 
than especially when noise alone is added. 
[0070] Furthermore, because the same data can be 
used forthe audio signal and the sample-playback audio 
signal, it is not necessary, to prepare another data for 
sample playback. Even if one whole piece of music is 
made audible for sample playback, it will not impact the 
data storage capacity of the storage medium (storage 
107 in Embodiment 1). 

[0071 ] The consumer also can store a received audio 
signal with a watermark in the storage medium (storage 
114 in Embodiment 1), which permits the consumer to 
freely pass to others the audio signal with a watermark 
stored in the storage medium, that is, free passing of 
audio signal distribution data among the consumers, 
which helps promote the sale of audio contents. In ad- 
dition, because the consumer can not remove water- 
mark information unless a fee is paid for the second key, 
the contents provider can protect the copyright of high 
sound quality contents and could charge a fee as nec- 
essary. 

[0072] It is noted that since the prior art watermark 
technique is possible to apply, it goes without saying that 
the technique according to the present invention is easy 
to adopt. In Embodiment 1, furthermore, the present 
technique is not to embed significant information (that 
is extracted and further utilized) unlike the prior art. In 
generating a watermark (by watermark signal generator 
106), therefore, the key and audio signal alone are in- 
putted. There is no need to input the significant informa- 
tion. That makes it easy to prepare a watermark. In ad- 
dition, since different watermarks are generated for dif- 
ferent audio signals, the copyright can be protected 
more strictly. In case the watermark does not have to be 
encrypted, the key does not have to be inputted, either. 
[0073] In Embodiment 1, the audio signal processor 
1 02 may be provided as independent apparatus (that is, 
an audio signal processing apparatus). In this case, a 
storage other than the storage 1 07 is provided within the 
audio signal processing apparatus and the first key 1 09 
is stored in the other storage, whereby the same func- 
tion can be provided. The audio signal processing ap- 
paratus is provided as independent apparatus like that, 
whereby it is possible to send an audio signal with a wa- 
termark using the existing signal distribution means. 
[0074] Furthermore, embedding means 104 may be 
provided as an independent apparatus (that is, an em- 
bedding means). In this case, embedding means 104 



and compressor 1 05 become independent of each oth- 
er. That permits free selection of techniques used for 
embedding and removing the watermark, and compres- 
sion and decompression when an audio distribution sys- 

5 tern is constructed. This is especial ly useful when the 
audio signal supplier and the contents provider who dis- 
tributes audio signal distribution data are different. In 
other words, the supplier of audio signal supplies water- 
marked audio signals alone, while the respective con- 

10 tents providers themselves perform compression on the 
basis of their respective methods, thus saving the audio 
signal supplier labor. 

[0075] In the configuration of the audio signal proces- 
sor 102 in the distribution apparatus 101, a watermark 

is is embedded in the input of audio signal by embedding 
means 104, and then the audio signal with a watermark 
is compressed by compressor 105. Alternatively, com- 
pressor 1 05 may be provided before embedding means 
1 04. In this case, the audio signal is first compressed by 

20 compressor 105, followed by embedding a watermark 
by embedding means 1 04. If the compressor 1 05 is pro- 
vided before embedding means 104, the operation 
amount by embedding means 104 can be reduced. 
[0076] In the above case, the audio signal can be re- 

25 produced by providing decompressor 113 after remov- 
ing means 115 on the distribution apparatus side. In this 
case, however, reversing means 118 is not always nec- 
essary. 



[0077] There will be explained the outline of an elec- 
tronic audio distribution system according to Embodi- 
ment 2 of the present invention with reference to FIG. 

35 1 , FIG 3 and FIG 4. Theelectronic audio distribution sys- 
tem in Embodiment 2 is almost identical in configuration 
with that of Embodiment 1 . The points where Embodi- 
ment 2 is different from Embodiment 2 alone will be ex- 
plained. The audio signal processor 1 02 in Embodiment 

40 2 is configured as shown in FIG. 3 (a). That is, the audio 
signal processor 102 is made up of a compressor 301 , 
an embedding means 302, and a high quality sound part 
encryptor 303. As shown in FIG. 4 (a), furthermore, the 
compressor 301 is provided with a frequency band sep- 

45 arator 401 and an encoder 402. 

[0078] The audio signal inputted in the distribution ap- 
paratus 1 01 is first received by the frequency band sep- 
arator 401 in the compressor 301 . The received audio 
signal is separated by a band separation filter into a plu- 

50 raHty of frequency bands 403 as shown in FIG. 4 (b) and 
sent to encoder 402. 

[0079] Here, with regard to the audio signals separat- 
ed into the bands as above, a basic part 404 and a high 
quality sound part 405 are defined as shown in FIG. 4 
55 (b). The basic part 404 is a telephone voice band (300 
to 3.4 kHz) and indicates the minimum frequency band 
required when audio contents are reproduced. Mean- 
while, the high quality sound part 405 indicates a high 
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frequency band that gives grace to the sound quality and 
a low frequency band that gives a heavy low sound pow- 
er. For example, those are bands not lower than 3.4 kHz 
and not higher than 300 Hz. 

[0080] Then, the encoder 402 encodes the audio sig- 
nal separated into the plurality of frequency bands 403, 
which are then sent to the embedding means 302. 
Among the examples of encoding is Huffman coding. 
But the encoding is not always required. 
[0081 J Using a first key 304 read from the storage 1 07 
shown in FIG. 1 , the embedding means 302 embeds the 
third key 305 as watermark information in the basic part 
404 of the audio signal that is separated into the plurality 
of frequency bands and encoded. Here, the third key 
305 corresponds to a fourth key which will be described 
later. It is noted that the third key 305 is embedded as 
watermark information, but the watermark information 
does not have to be at a level of sound audible to the 
human sense of hearing as in Embodiment 1 . Further- 
more, the third key 305 is also to be read out from the 
storage 107. Then, the embedding means 302 sends to 
high quality sound part encryptor 303 the audio signal 
with the watermark signal embedded therein. 
[0082] Then, the high quality sound part encryptor 
303 encrypts the encoded high quality sound part 405 
using the fourth key 311 . In encryption of the high quality 
sound part 405, in this connection, it is conceivable that 
the whole code string of the high quality sound part 405 
or only several bits on the least significant bit (LSB) side 
alone will be encrypted, for example. LSB means the 
part in the code string forming the high quality sound 
part 405 that has the least effect on the sound quality. 
Here, it is possible to control the sound quality of the 
high quality sound part 405 by adjusting the several bits 
to be encrypted. 

[0083] In the next step, the high quality sound part en- 
cryptor 303 sends to distributor 1 03 the basic part 404 
with the watermark signal embedded therein and the en- 
crypted high quality sound part 405 as single com- 
pressed, encrypted audio signal. 
[0084] The process is the same as in Embodiment 1 
up to the point where the compressed, encrypted audio 
signal is sent to distributor 1 03 and then received by res- 
toration means 121 via the transceiver 1 1 2 in the audio 
player 111 . But Embodiment 2 is different from Embod- 
iment 1 in that restoration means 121 in Embodiment 2 
is configured as in FIG. 3 (b). In other words, the resto- 
ration means 121 is composed of an extracting means 
306, a high quality sound part decoder 307, a synthe- 
sizing means 308, and a decompressor 309. 
[0085] Using the second key 310 read out from the 
storage 114 shown in FIG. 1 , the extracting means 306 
reads out the third key 305 embedded in the basic part 
404 of the compressed, encrypted audio signal received 
from the distribution apparatus 101. The third key 305 
thus read out is sent to the high quality sound part de- 
coder 307. In this connection, the way of acquiring the 
second key 310 is the same as that in Embodiment 1 . 



[0086] Then, high quality sound part decoder 307 de- 
codes (decrypts) the encrypted high quality sound part 
405 using the compressed, encrypted audio signal re- 
ceived also from the distribution apparatus 101 and the 
5 third key 305 read out by the extracting means 306 and 
sends the decoded high quality sound part 405 to syn- 
thesizing means 308. 

[0087] Synthesizing means 308 synthesizes the basic 
part 404 of the compressed, encrypted audio signal re- 
10 ceived from the distribution apparatus 101 and the de- 
coded high quality sound part 405 received from high 
quality sound part decoder 307, and sends the synthe- 
sized signal to decompressor 309 as compressed audio 
signal. The decompressor 309 decomposes the audio 

15 signal compressed in a method corresponding to that 
for the compression means 301 and outputs the same. 
After that, the audio signal outputted from the decom- 
pressor 309 Is played back acoustically by audio play- 
back means 11 9 the same way as Embodiment 1 . 

20 [0088] It is noted that the third key 305 stored in the 
basic part 404 is read out using the second key 31 0. But 
in case the restoration means 121 has not the second 
key 310, the high quality sound part decoder 307 can 
not decode the encrypted high sound quality part. 

25 Therefore, since decompressor 309 can not decom- 
press the high sound quality part, the user can not play 
back the audio contents. But the basic part 404 is not 
encrypted, and the audio contents can be listened to 
though the sound is of a low quality. 

30 [0089] Thus, the user can sample listen to audio con- 
tents enough before deciding whether to buy the second 
key. The contents provider can prevent illegal use and 
illegal copying of high sou nd quality audio contents. Pro- 
tecting the copyright, the contents provider can urge the 

35 user to buy the key. 

[0090] In Embodiment 2, the high quality sound part 
405 is encrypted and the third key is embedded in the 
basic part 404. Alternatively, the basic part 404 may be 
encrypted and the third key may be embedded in the 

40 high quality sound part 405. In this case, needless to 
say, for sample playback, high sound quality part alone 
can be reproduced, and as a result, reproduction will be 
of low sound quality. In the present embodiment, further- 
more, the third key is once extracted from the second 

45 key, and the encrypted part is decoded. Alternatively, 
the encrypted part may be directly decoded using the 
second key. In this embodiment, however, the third key 
has to be taken out once, and it can be said that the 
security is strict. Especially, in case a consumer ac- 
so quires an illegal copy of an audio signal using such a 
means that degrades the sound quality greatly, it is im- 
possible to make a high sound quality reproduction be- 
cause the third key can not be taken out. 

55 Embodiment 3 

[0091] The outline of the audio distribution system ac- 
cording to a third embodiment of the present invention 
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will be explained with reference to FIG. 1 and FIG. 5. 
The audio distribution system of Embodiment 3 is al- 
most identical with that in Embodiment 2 in configura- 
tion, and what is different between the two embodiments 
will be explained. The audio signal processor 1 02 in Em- 
bodiment 3 is configured as in FIG. 5 (a). That is, the 
audio signal processor 102 has a scalable compressor 
501 , an embedding means 302 and an enhanced part 
encryptor 502. But the embedding means 302 is the 
same as that in Embodiment 2. 

[0092] An audio signal inputted in audio signal proc- 
essor 1 02 is received by scalable compressor 501 . The 
received audio signal is compressed by scalable com- 
pressor 501 and transmitted to embedding means 302. 
[0093] Here, the scalable compression does not 
mean a compression not involving separating the audio 
signal per band as band separation by compressor 301 
in Embodiment 2 but involving separating the audio sig- 
nal into the basic part (main stream) and enhanced part 
(extension stream). It is a compression method provided 
for in MPEG (Motion picture Export Group). In other 
words, in case of scalable compression, the basic part 
and enhanced part each can contain the whole band in 
Embodiment 3. That is different where Embodiment 3 is 
different from Embodiment 2. 

[0094] As in Embodiment 2, the embedding means 
302 embeds the third key 504 as watermark information 
in the basic part of the scalable compressed audio signal 
using the first key 503 read out from the storage 1 07 
shown in FIG. 1. However, the watermark information 
also does not have to be at a sound level audible to the 
human sense of hearing as in Embodiment 1 . Then, the 
embedding means 302 sends the audio signal with the 
watermark embedded therein to the enhanced part en- 
cryptor 502. 

[0095] Then, using the fourth key 505 read out from 
the storage 107, the enhanced part encryptor 502 en- 
crypts the enhanced part prepared by the scalable com- 
_ pressor501, and sends the sametogetherwith the basic 
part to the distributor 1 03. 

[0096] The scalable compressed, encrypted audio 
signal is sent to the distributor 103, and then received 
by restoration means 121 in the audio player 111 via the 
transceiver 1 1 2. The process up to that point is the same 
as in Embodiments 1 and 2. However, the restoration 
means 121 in Embodiment 3 is configured as FIG. 5 (b) 
where the present embodiment is different from Embod- 
iments 1 and 2. In other words, an enhanced part de- 
coder 507 is provided in place of the high quality sound 
part decoder 307 used in Embodiment 2, and another 
difference is that the decompressor 309 is a decompres- 
sor for scalable compression in the present embodi- 
ment. 

[0097] The enhanced part decoder 507 encodes the 
scalable compressed encrypted part also received from 
the distribution apparatus 101 and the enhanced part 
encrypted by the third key 504 read out using the second 
key 506 at the extracting means, and sends the encrypt- 



ed enhanced part to the synthesizing means 308. By the 
way, the third key 504 is key information for the fourth 
key 505. 

[0098] The synthesizing means 308 synthesizes the 
5 basic part of the scalable compressed, encrypted audio 
signal received from the distribution apparatus 101 and 
the encrypted enhanced part received from the en- 
hanced part decoder 507 and sends the same to de- 
compressor 309 as scalable compressed audio signal. 
io The decompressor 309 decompresses the scalable 
compressed audio signal in a method corresponding to 
that of the scalable compressor 501 . Afterthat, theaudio 
signal outputted from decompressor 309 is reproduced 
acoustically by the audio playback means 1 1 9 as in Em- 
's bodiments 1 and 2. 

[0099] Using the second key 506, the third key 504 is 
extracted and the enhanced part is decoded. But in case 
the restoration means 121 has not the second key 506, 
the third key 504 can not be extracted, and therefore, 
the enhanced part decoder 507 can not decode the en- 
crypted enhanced part. Therefore, the decompressor 
309 can not decompress the enhanced part, and the us- 
er can not play back the audio content of high sound 
quality. But the basic part is not encrypted, and the audio 
content can be played back through poor in sound qual- 
ity. 

[01 00] Thus, the user can playback sufficiently to the 
audio contents before deciding whether to buy the sec- 
ond key. The contents provider can prevent the audio 
contents from being illegally used or illegally copied and 
thus can protect the copyright strictly. At the same time, 
the contents provider can urge the user to pay a fee and 
buy the key. 

[0101] Even in a general-purpose audio player other 
than the audio player having the restoration means 1 21 , 
the scalable compressed and encrypted audio signal 
can be played back though poor in sound quality. 
[0102] In this connection, in case audio contents (au- 
dio signal) that are not so high in sound quality in them- 
selves are scalable compressed and if the enhanced 
part is encrypted, it is conceivable that the sound quality 
does not degrade so much and the sound quality that is 
not much different from the audio signal before encryp- 
tion can be played back without the user's buying the 
second key. In such a case, the enhanced part encryptor 
502 may be made a basic part encryptor for encrypting 
the basic part and the enhanced part decoder 507 may 
be made a basic part decoder for decoding the basic 
part. In this case, the third key is embedded in the en- 
hanced part. 

Embodiment 4 

[0103] Next, there will be explained the outline of an 
audio distribution system according to Embodiment 4 of 
the present invention with reference to FIG. 1 and FIG. 
6. The audio distribution system according to Embodi- 
ment 4 is almost identical with that of Embodiment 1 in 
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configuration, and what the present embodiment is dif- 
ferent from Embodiment 1 alone will be described. Here 
in Embodiment 4, the audio signal processor 102 is con- 
figured as shown in FIG. 6 (a). That is, the audio signal 
processor 1 02 is made up of a noise parameter gener- 5 
ator 601 , a noise generator 602, an amplifier 603, afirst 
adder 604, a watermark signal generator 605 and a sec- 
ond adder 606. But the noise parameter generator 601 
does not always have to be within the audio signal proc- 
essor 1 02. The noise parameter may be inputted sepa- 10 
rately from outside. 

[0104] First when an audio signal is inputted in thefirst 
adder 604 in the audio signal processor 102, the noise 
parameter from the noise parameter generator 601 is 
inputted in the noise generator 602 and the watermark is 
signal generator 605. The noise generator 602 gener- 
ates a noise signal on the basis of the noise parameter. 
Here, the noise parameter Is a reference value or the 
like to produce the noise signal, for example. Any noise 
generator 602 that will produce noise on the basis of this 20 
reference value will serve the purpose. Also, an index 
will do that selects a noise signal pattern prepared in 
advance in the noise generator 602. Furthermore, the 
noise signal may be a sound that makes the listenerfeel 
unpleasant. Also it may be an announcement like This 25 
music is for sample playback." 

[01 05] The noise signal that is generated by the noise 
generator 602 is amplified a specific number of times by 
amplifier 603, and the noise signal is sent to the first 
adder 604. The first adder 604 adds the noise signal to 30 
the audio signal and sends to the second adder 606 the 
signal as audio signal of a low sound quality containing 
noise. 

[0106] The watermark signal generator 605, which 
has received a noise parameter, prepares a watermark 35 
signal of the noise parameter using the first key read out 
from the storage 1 07. In this case, since it is necessary 
to prepare a watermark signal on the basis of the audio 
signal of the low sound quality, the audio signal of the 
low sound quality prepared by the first adder 604 is also 40 
inputted in the watermark signal generator 605. But the 
audio signal of a high sound quality before it is proc- 
essed by the first adder 604 may be inputted. 
[0107] The noise parameter watermark signal pre- 
pared by the watermark signal generator 605 Is sent to 45 
the second adder 606 and embedded as watermark sig- 
nal in the audio signal of a low sound quality also sent 
from the first adder 604. 

[0108] The audio signal of the low sound quality with 
the watermark signal embedded therein is sent to the so 
distributor 103 as noise mixed audio signal. 
[0109] After having been sent to distributor 103, the 
noise-mixed audio signal is processed the same way as 
in Embodiments 1, 2 and 3 up to the point where the 
signal is received by the restoration means 121 via the 55 
transceiver 112 in the audio player 111. However, the 
restoration means 121 is configured as in FIG. 6 (b) 
where Embodiment 4 is different from Embodiments 1 , 



2 and 3. That is, the restoration means 121 is formed of 
an extracting means 608, a noise generator 609, an am- 
plifier 610 and an adder 611. 

[01 10] If the noise-mixed audio signal is inputted, the 
extracting means 608 extracts watermark signal, that is, 
a noise parameter from the noise-mixed audio signal us- 
ing the second key 612 read out from the storage 114 
shown in FIG. 1 and sends the same to the noise gen- 
erator 609. On the basis of the extracted noise param- 
eter, the noise generator 609 produces the same noise 
signal as the noise signal produced by the noise gener- 
ator 602 and sends the same to the amplifier 61 0. 
[0111] At the amplifier 610, the noise signal is ampli- 
fied a specific number of times as was done a specific 
number of times at amplifier 603. Furthermore, with the 
amplitude reversed, the noise signal is outputted to the 
adder 611 . 

[0112] The adder 611 adds the noised-mixed audio 
signal and the noise signal amplified a specific number 
of time with the amplitude reversed, and thus can re- 
move the noise signal contained in the noise-mixed au- 
dio signal. Then, the audio signal outputted from the 
adder 611 is reproduced by the audio playback means 
11 9 in the same way as Embodiments 1 , 2 and 4. The 
method of acquiring the second key 612 is the same as 
that in Embodiments 1 , 2 and 3. It goes without saying 
that the user can not remove the noise signal unless a 
fee is paid for the second key. 

[01 13] As set forth above, the user can perform sam- 
ple-playback the audio content enough before deciding 
whether to buy the second key. The contents provider 
can prevent the audio contents of high sound quality 
from being illegally utilized orcopied, thus protecting the 
copyright strictly and urging the user to pay a fee and 
buy the key. 

[0114] Announcement can be used as noise signal 
and will not make the user feel unpleasant when the 
noise signal is reproduced. Announcement can be used 
in many ways. Through the announcement, for exam- 
ple, the contents provider informs the user that the noise 
signal can be removed if the key is purchased, for ex- 
ample. 

[0115] Furthermore, if the noise parameter itself is the 
second key, and if there are a plurality of noise param- 
eters, a plurality of noise-mixed audio signals will be pre- 
pared. Those different noise-mixed audio signals need 
different keys (noise parameters). In Embodiment 4, 
however, the noise parameter is embedded as water- 
mark, and a second key to extract the watermark is pre- 
pared separatefy, whereby one kind of second key alone 
can control key information with ease even if a plurality 
of noise-mixed audio signals are prepared with a plural- 
ity of noise parameters. 

Embodiment 5 

[0116] The outline of a audio distribution system ac- 
cording to Embodiment 5 of the present invention will 
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be explained with reference to FIG. 1 and FIG. 7. The 
audio distribution system in Embodiment 5 is almost 
identical with that in Embodiment 1 in configuration. 
What the present embodiment is different from Embod- 
iment 1 aione will be explained. The audio signal proc- 
essor 1 02 in Embodiment 5 is configured as shown in 
FIG. 7 (a). That is, the audio signal processor 102 is 
formed of an embedding means 701 and an encryptor 
702. 

[01 17] First, an audio signal is inputted in embedding 
means 701 in the audio signal processor 1 02. Then mu- 
sic ID information read out from the storage 107 is em- 
bedded. In this case, the music ID information may be 
embedded this way. As Embodiment 1 , a watermark sig- 
nal is generated and added to the audio signal. Here, 
the music ID information is a unique ID for the audio sig- 
nal (that is, music), and with the ID information, It is pos- 
sible to specify the audio signal. 

[0118] Then, the audio signal with the music ID infor- 
mation embedded therein is transmitted to the encryptor 
702, encrypted and sent to the distributor 1 03. Here in 
the distributor 103, the address information of the distri- 
bution apparatus 1 01 is added to the encrypted audio 
signal in which the music ID information is embedded 
and encrypted, but identification information to specify 
the audio signal is not added, where Embodiment 5 is 
different from Embodiment 1 . That is because the music 
ID information has already been embedded and there 
is no need for that. 

[0119] After having been sent to the distributor 103, 
the encrypted audio signal is received by the restoration 
means 121 via the transceiver 112 in the audio player 
1 1 1 to which the address information is added. The proc- 
ess up to that point is the same as in Embodiments 1 to 
4. But the restoration means 121 in Embodiment 5 is 
configured as shown in FIG. 7 (b), where Embodiment 
5 is different from Embodiments 1 to 4. In other words, 
the restoration means 121 is made up of a decoder 704, 
an extracting means 705, noise generator 706, a switch 
707 and an adder 708. 

[0120] First, the encrypted audio signal is inputted in 
the restoration means 121, and then the encrypted au- 
dio signal is decoded by the decoder 704 into an audio 
signal with an music ID information embedded therein. 
Here, the decoder is so designed as to decode an audio 
signal encrypted in advance by the encryptor 702. That 
is, the encrypted audio signal is of the data type that can 
be reproduced by the audio player 111 only. 
[0121] Then, the audio signal with the music ID infor- 
mation embedded therein is sent to the extracting 
means 705 and the adder 708. Receiving the audio sig- 
nal with the music ID information embedded therein, the 
extracting means 705 extracts music ID information and 
sends the music ID information to the switch 707. 
[0122] The switch 707 reads out the key 709 corre- 
sponding to the music ID information from the storage 
114 in FIG. 1 on the basis of the extracted music ID in- 
formation. 



[0123] Here, in case the key 709 is in the storage 114, 
the noise signal produced by the noise generator 706 
can be shut out by turning off the switch 707 connecting 
the noise generator 706 and the adder 708. 
5 [0124] Because the noise signal is shut down, the 
adder 708 sends to the audio playback means 119 the 
audio signal with the music ID information embedded 
therein without adding the noise signal. The audio play- 
back means 119 can reproduce the audio signal with the 
music ID information embedded therein, that is, the au- 
dio signal of a high sound quality. It goes without saying 
that the music ID information with the watermark infor- 
mation embedded therein does not degrade the sound 
quality. 

[0125] Here, in case there is not the key 709 in the 
storage 114, and the noise signal produced by the noise 
generator 706 is sent to the adder 708 by turning on the 
switch 707 connecting the noise generator 706 and the 
adder 708. The adder 708 adds the noise signal to the 
audio signal with the music ID information embedded 
therein and outputs the same to the audio playback 
means 119. Therefore, unless the key exclusively for 
music is not purchased, the switch 707 can not be turned 
off and the music of a low sound quality with a noise 
component added thereto alone can be reproduced. 
The procedure for purchasing the key exclusively for 
music is the same as that for purchasing the second key 
as described in Embodiments 1 and 2. 
[0126] As set forth above, in the present embodiment, 
too, the user can freely perform sample-playback to the 
audio contents before deciding whether to buy the key 
exclusively for music. The contents provider can prevent 
illegal use or illegal copying of the audio contents of a 
high sound quality and protect the copyright, and urge 
the consumer to pay a fee and buy the key. 
[0127] If the music ID information is just added to the 
audio signal as identification information, the following 
problem will arise. If, for example, the audio signal is 
once D/A (digital-analog) converted, the identification 
information will be eliminated and the music ID informa- 
tion of the audio signal will be impossible to recognize. 
The rightful user who has a key exclusively for music 
can not reproduce the audio signal of a high sound qual- 
ity. In the present embodiment or Embodiment 5, how- 
ever, since the music ID information is embedded as wa- 
termark, the music ID information will not be lost by 0/ 
A (digital/analog) conversion or A/D (analog-digital con- 
version). The rightful user who has the key exclusively 
for music can reproduce the audio signal of a high sound 
quality. 

[0128] Similarly, mere addition to the audio signal of 
the music ID information as identification information 
permits the editing and erasing of music ID information 
using a binary editor and it could be feared that the audio 
signal will be modified illegally. In Embodiment 5, how- 
ever, music ID information is embedded as watermark 
and it is difficult to modify the music ID information by 
binary editor etc. That is, the copyright can be protected 
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further strictly. 
Embodiment 6 

[0129] Next, the outline of a audio distribution system 
according to Embodiment 6 of the present invention will 
be described with reference to FIG. 1 , FIG. 7 and FIG. 
8. The audio distribution system in Embodiment 6 is al- 
most identical with that in Embodiment 5 in configura- 
tion, and what the present embodiment is different from 
Embodiment 5 alone will be explained. Here, the audio 
signal processor 102 in Embodiment 6 is identical in 
configuration with that in Embodiment 5 shown in FIG. 
7 (a). Also, the process up to the following point is the 
same as that in Embodiment 5. That is, the music ID 
information read out from the storage 107 is embedded 
in an inputted audio signal and encrypted by the encryp- 
tor 702 into encrypted audio signal, and sent to distrib- 
utor 1 03. Then the audio signal is received by the res- 
toration means 1 21 by way of the transceiver 1 1 2 in the 
audio player 111. The process up to this point is the 
same. But the restoration means 121 in Embodiment 6 
is configured as shown in FIG. 8 and is different from 
that in Embodiment 5. That is, the restoration means 
121 in Embodiment 6 is formed of a decoder 704, an 
extracting means 705, a counter 801 , a storage 802 and 
a switch 803. In addition, the music ID information con- 
tains the permissible number of sample playback as da- 
ta. 

[0130] First, the encrypted audio signal is inputted in 
the restoration means 121. Then, the encrypted audio 
signal is decoded by the decoder 704 into an audio sig- 
nal with music ID information embedded therein. Here, 
the decoder 704 is so designed to decode the audio sig- 
nal encrypted by the encryptor 702. That is, the encrypt- 
ed audio signal is of a datatype that can be reproduced 
only by the audio player 111 in Embodiment 6. 
[0131] Then, the audio signal with the music ID infor- 
mation embedded therein is sent to extracting means 
705 and the switch 803. Receiving the audio signal with 
the music ID information embedded therein, the extract- 
ing means 705 extracts music ID information and sends 
this music ID information to the counter 801 . 
[0132] The counter 801 reads out a key 709 for the 
music ID information from the storage 114shown in FIG. 
1 on the basis of the extracted music ID Information. 
[01 33] Here in case the key 709 is found in the storage 
114, the audio signal decoded by the decoder 704 can 
be sent to the audio playback means 1 1 9 by the counter 
801 turning on the switch 803 and the audio signal can 
be immediately played back. As in Embodiment 5, it 
goes without saying that the music ID information with 
the watermark information embedded therein is not to 
degrade the sound quality. 

[0134] Here in case there is not the key 709 in the 
storage 114, the counter 801 checks on the basis of the 
music ID information whether the permissible number 
of sample playback is memorized in the storage 802. In 



case the permissible number of sample playback is not 
memorized, it will be shown that the audio signal with 
the music ID information is played back for the first time. 
Furthermore, the number of sample playback in the mu- 
5 sic ID information is read out, and is memorized in stor- 
age 802 along with the music ID information. Here, the 
permissible number of sample playback is set for every 
music ID information. Here, the number is at 5, for ex- 
ample. 

10 [0135] Then, the counter 801 subtracts one from the 
permissible number of sample playback memorized in 
the storage 802 and sets the number at 4 and turns on 
the switch 803. In case the number of sample playback 
is already memorized, it will be judged whether the 
is number of sample playback is larger than 0. If so, 1 is 
subtracted and the switch 803 is turned on. If the number 
of sample playback is O, the switch 803 will be turned 
off. If the switch 803 is turned off, the audio signal with 
the music ID information embedded therein will not be 
sent to the audio playback means 1 1 9. That is, the audio 
signal can not be reproduced. 

[0136] Therefore, the user who has not bought a key 
exclusively for music can perform sample-playback to 
the audio signal a number of times memorized in ad- 
vance in music ID information. When the number of 
playback left decreases to 0, that is, the audio signal has 
been listened the permissible number of times, the au- 
dio signal can not be reproduced any more. The key 709 
is purchased in the same way as the second key in Em- 
bodiment 1 . 

[0137] As set forth above, in Embodiment 6, the user 
can perform sample-playback the audio signal a specific 
number of times before deciding whether to buy the key 
exclusively for music. The contents provider can prevent 
illegal use or copying of audio contents of a high sound 
quality, protect the copyright and urge the consumer to 
pay a fee and buy a key. 

[0138] As in Embodiment 5, furthermore, the music 
ID information will not be lost by the D/A conversion of 
the audio signal etc., and the rightful owner of key ex- 
clusively for music can reproduce the audio signal of a 
high sound quality. In addition, the music ID information 
is difficult to modify by binary editor etc., thus the copy- 
right is further strictly protected. 



Claims 

1 . An audio signal processor processing an audio sig- 
nal for changing a format distributable through a 
network, which comprising: 

embedding means for embedding in said audio 
signal a watermark of which a signal level au- 
dible to the human sense of hearing when the 
audio signal is played back. 

2. The audio signal processor according to claim 
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1 .which further comprising: 

a compressor for compressing said watermark 
embedded audio signal according to a specific 
method, said compressor provided after said 
embedding means. 

3. The audio signal processor according to claim 1, 
which further comprising: 

a compressor for compressing said watermark 
embedded audio signal according to a specific 
method, said compressor provided before said 
embedding means. 

4. The audio signal processor according to claim 
1 .which comprising: 

watermark signal generator for generating a 
watermark using said audio signal alone that is 
inputted into said watermark signal generator, 
said generator provided in said embedding 
means. 

5. An audio player playing back an audio signal dis- 
tributed through a network, which comprising: 

removing means for removing a watermark 
from a watermark embedded audio signal using 
a specific key, said watermark of which a signal 
level is audible to the human sense of hearing. 

6. An audio distribution system including a distribution 
apparatus for distributing an audio signal through a 
network and an audio player for playing back said 
distributed audio signal, 

wherein said distribution apparatus comprises 
embedding means for embedding in said audio 
signal a watermark of which a signal level is au- 
dible to the human sense of hearing when the 
audio signal is played back; and 
wherein said audio player comprises removing 
means for removing a watermark from said wa- 
termark embedded audio signal using a specif- 
ic key. 

7. An audio distribution method wherein a sending 
side process an audio signal for changing the for- 
mat distributable through a network and a receiving 
side plays back said audio signal, which compris- 
ing: 

embedding a watermark in said audio signal at 
the processing, said watermark of which a sig- 
nal level is audible to the human sense of hear- 
ing when the audio signal is played back ; and 
removing a watermark from said watermark 
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embedded audio signal using a specific key at 
the playback. 

An audio signal processor for processing an audio 
signal for changing the format distributable through 
a network, which comprising: 

a separator for separating said audio signal ac- 
cording to a specific rule; 
embedding means for embedding a key as a 
watermark in at least a specific audio signal of 
said separated audio signals; and 
encryption means for encrypting an audio sig- 
nal other than said separated audio signals in 
which said watermark is embedded. 

An audio player playing back an audio signal dis- 
tributed through the network, which comprising: 

extracting means for extracting a second key 
embedded as a watermark from a specific area 
within said audio signal using a first key; and 
a decoder for decrypting an encrypted area 
within said audio signal using said second key 
extracted by said extracting means. 

10. An audio signal processor processing an audio sig- 
nal for changing the format distributable through a 
network, which comprising: 

a band separator for separating said audio sig- 
nal into a plurality of frequency band signals 
having a specific frequency band respectively; 
embedding means for embedding a key as a 
watermark in a specific frequency band signal 
having the specific frequency band of said plu- 
rality of frequency band signals; and 
a high quality sound part encryptor for encrypt- 
ing a frequency band signal other than said plu- 
rality of frequency band signals in which said 
watermark is embedded. 

11 . An audio player playing back audio signal distribut- 
ed through a network, which comprising: 

extracting means for extracting a second key 
embedded as a watermark from a band signal 
having a specific frequency band within said 
audio signals using a first key; and 
a high quality sound part decoderf or decrypting 
a encrypted frequency band signal having spe- 
cific frequency bands within said audio signals 
using said second key extracted by said ex- 
tracting means. 

12. An audio signal processor processing an audio sig- 
nal for changing the format distributable through a 
network, which comprising: 
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a scalable compressor for separating the audio 
signal into a basic part and a enhanced part us- 
ing the method of scalable compression; 
embedding means for embedding a key as a 
watermark in either the basic part or the en- 
hanced part; and 

an encryptor for encrypting using a specific key 
either the basic part or the enhanced part 
whichever said watermark is not embedded in. 

1 3. An audio playerf or playing back an audio signal dis- 
tributed through a network, which comprising: 

extracting means for extracting a second key, 
using a first key, embedded as a watermark 
from either said basic part or said enhanced 
part within said audio signal which is com- 
pressed by scalable compression and encrypt- 
ed; and 

a decoderfor decrypting, using said second key 
extracted by said extracting means, either the 
basic part or the enhanced part whichever said 
watermark is not embedded in. 

14. An audio distribution system including a distribution 
apparatus for distributing an audio signal through a 
network, and an audio player playing back said dis- 
tributed audio signal, 

wherein said distribution apparatus compris- 



a separator for separating said audio signal ac- 
cording to a specific rule, 
embedding means for embedding a first key as 
a watermark in a specific audio signal of said 
separated audio signals; 
encryption means for encrypting a audio signal 
other than separated audio signals in which 
said watermark is embedded; and 
wherein said audio player comprises: 
extracting means for extracting said first key 
embedded as a watermark in said specific sig- 
nal, using said second key; and 
a decoder for decrypting the audio signal in 
which watermark is not embedded, using said 
first key extracted from said extracting means. 

15. An audio distribution method wherein a sending 
side processes an audio signal for changing the for- 
mat distributable through a network, and a receiving 
side plays back said audio signal, which compris- 
ing: 

in the processing, 

separating the audio signal according to a spe- 
cific rule; 

embedding a first key as watermark in a specific 
audio signal of said separated audio signals; 



encrypting the audio signal other than said sep- 
arated audio signals in which said watermark is 
embedded; and 
in the playing back, 
s extracting said first key embedded as a water- 

mark in said specific signal using said second 
key; and 

decrypting the audio signal, in which said wa- 
termark is not embedded, using said extracted 
10 first key. 

1 6. An audio signal processor processing an audio sig- 
nal for changing to the format distributable through 
a network, which comprising: 

15 

noise parameter generator for generating a 
noise parameter for producing a noise; 
noise generator for producing a noise signal on 
the basis of the noise parameter generated by 

20 said noise parameter generator; 

a amplifier for amplifying said noise signal to a 
signal level audible to the human sense of hear- 
ing when the signal is played back; 
a first adder for adding to said audio signal said 

25 noise signal amplified by said amplifier; 

a watermark signal generator for generating a 
watermark signal with the noise parameter as 
a watermark using a key; and 
a second adder for adding said watermark sig- 

30 nal generated by said watermark signal gener- 

ator to an audio signal to which noise signal is 
added by said first adder. 

17. An audio player playing back an audio signal dis- 
ss tributed through a network, which comprising: 

watermark signal extracting means for extract- 
ing a noise parameter for producing a noise sig- 
nal using a specific key, said noise parameter 

40 contained as a watermark in said audio signal; 

a noise generator for generating a noise signal 
on the basis of said extracted noise parameter; 
an amplifier for amplifying said noise signal a 
specific number of times and reversing the am- 

45 plitude; and 

an adder for adding to sard audio signal a noise 
signal that is amplified a specific number of 
times and of which the amplitude is reversed. 

50 18. An audio signal processor processing an audio sig- 
nal for changing to the format distributable through 
a network, which comprising: 

watermark embedding means for embedding 
55 music ID information as a watermark in an au- 

dio signal, said music ID information specifying 
said audio signal; and 

encryption means for encrypting said audio sig- 
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na! embedded with the watermark. 

19. An audio player playing back an audio signal dis- 
tributed through a network, which comprising: 



a decoder for decrypting said encrypted audio 
signal; 

watermark extracting means for extracting mu- 
sic ID information contained as a watermark in 
said decrypted audio signal, said music ID in- 10 
formation specifying said audio signal; 
a noise generator for generating noise signal to 
degrade said audio signal; 
an adder for adding said decrypted audio signal 
and the noise signal generated by said noise is 
generator; and 

a switch forturning on or off the inputting of said 
noise signal to said adder in a result that a spe- 
cific key corresponding to said extracted music 
ID information is present or not. 20 

20. An audio signal processor processing an audio sig- 
nal for changing to the format distributable through 
a network, which comprising: 

25 

watermark embedding means for embedding 
as a watermark a music ID information in said 
audio signal, said music ID information includ- 
ing information for specifying said audio signal 
and indicating permissible numbers of sample so 
playback; and 

encryption means for encrypting said audio sig- 
nal embedded with a watermark. 

21. An audio player playing back an audio signal dis- 35 
tributed through a network, which comprising: 

a decoder for decrypting said encrypted audio 
signal; 

a watermark extracting means for extracting an *o 
music ID information contained in said decrypt- 
ed audio signal as an watermark, said music ID 
information including information for specifying 
said audio signal and indicating permissible 
numbers of sample playback; 45 
a storage for storing said extracted music infor- 
mation associating said information specifying 
said audio signal and said information indicat- 
ing permissible numbers of sample playback; 
and so 
a counter for deciding whether the decrypted 
audio signal is played back or not in a result that 
a specific key corresponding to the extracted 
music ID information is present or not and what 
the numbers of sample playback indicates. 55 



BNSDCCID: <EP 1 1 89372A2_I_> 



15 



EP1 189 372 A2 




EP1 189 372 A2 




i 



EP1 189 372 A2 



© 




OS 

O 

GO 
CO 

w 

CJ 
O 
CC 



< 

O 

GO 
O 

S 

< 



© 




2 




DD 


AN 


tu 


til 


MB 




III 







PC 


3 


o 




CO 


CO 


CO 


3 


RE 


CA 


MP 


GO 


o 







1 L - 7- 



\ Pi 



DC 
> P 

H 
00 
aS 



3 >: 2i 



CO 

o 




2^ 

< CO 



o 

10 




20 

bNoui-OiLr. <tP 1 itf«3/ZA£J__> 



EP 1 189 372 A2 




21 



EP1 189 372 A2 




BNSDOCID: <EP 1189372A2_L> 



22 



EP1 189 372 A2 



< 



00 




tu < 55 



bMSLXjCID: <EP 1189372A2_L> 



23 



EP 1 189 372 A2 



O 



CO 

o 



AUDIO 
PLAYBACK 
MEANS 




AUDIO SIGNAL 



BNSDCClD: <EP 1189G72A2J_> 



24 



EP 1 189 372 A2 



FIG.10 



SOUND 
PRESSURE 



SPECTRUM OF SINE WAVE 




MASKING LEVEL 



Fl FO F2 



FREQUENCY 



SOUND 
PRESSURE] 



FIG.ll 



AUDIO SIGNAL: 1101 




SIGNAL OF SOUND PRESSURE 
EXCEEDING THE MASKING LEVEL 
(NOISE PERCEIVABLE TO THE 
HUMAN SENSE OF HEARING): 1103 



FREQUENCY 



MASKING LEVEL: 1102 



25 



nNSOCCia <Zr i |6S3J2AZ_I_> 



